A terminological and ontological analysis of the NCI Thesaurus.

نویسندگان

  • W Ceusters
  • B Smith
  • L Goldberg
چکیده

OBJECTIVE The National Cancer Institute Thesaurus is described by its authors as "a biomedical vocabulary that provides consistent, unambiguous codes and definitions for concepts used in cancer research" and which "exhibits ontology-like properties in its construction and use". We performed a qualitative analysis of the Thesaurus in order to assess its conformity with principles of good practice in terminology and ontology design. MATERIALS AND METHODS We used both the on-line browsable version of the Thesaurus and its OWL-representation (version 04.08b, released on August 2, 2004), measuring each in light of the requirements put forward in relevant ISO terminology standards and in light of ontological principles advanced in the recent literature. RESULTS We found many mistakes and inconsistencies with respect to the term-formation principles used, the underlying knowledge representation system, and missing or inappropriately assigned verbal and formal definitions. CONCLUSION Version 04.08b of the NCI Thesaurus suffers from the same broad range of problems that have been observed in other biomedical terminologies. For its further development, we recommend the use of a more principled approach that allows the Thesaurus to be tested not just for internal consistency but also for its degree of correspondence to that part of reality which it is designed to represent.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Grading glioma tumors using OWL-DL and NCI Thesaurus

Brain tumors' treatment and prognosis depend to a large extent on their grades. Grading tumors follows a set of rules that refers to domain knowledge. Developing an automatic grading system requires explicit and formal representation of the domain. The NCI Thesaurus is the major ontological resource in the cancer domain. However, the description of brain tumors and grades in the NCI Thesaurus d...

متن کامل

Construction of a Condensed Thesaurus for Building Radiology Ontology

The building of thesauri for large domains, especially for medicine, is a costly affair. However, in many domains thesauri can be constructed on an ontological basis [Wielinga , Schreiber, 2001]. We are developing an ontological information retrieval system for the retrieving of medical records from an electronic medical record system (EMR). We decided to use the UMLS as a basis for building th...

متن کامل

Methodology for CIDOC CRM based data integration with spatial data

In this paper we want to present a methodology for data integration based on the CIDOC CRM. Spatial data are included in the integration process which provides us on the one hand with the possibility to access the CRM structured data through an interactive map. On the other hand in future GIS functionalities of spatial analysis can generate new data within the ontological database that could no...

متن کامل

Lists, Taxonomies, Lattices, Thesauri and Ontologies: Paving a Pathway Through a Terminological Jungle

This article seeks to resolve ambiguities and create a shared vocabulary with reference to classification-related terms. Due to the need to organize information in all disciplines, knowledge organization systems (KOSs) with varying attributes, content and structures have been developed independently in different domains. These scattered developments have given rise to a conglomeration of classi...

متن کامل

Towards a Broad-Coverage Biomedical Ontology Based on Description Logics

We describe an ontology engineering methodology by which conceptual knowledge is extracted from an informal medical thesaurus (UMLS) and automatically converted into a formal description logics system (LOOM). Our approach consists of four steps: concept definitions are automatically generated from the UMLS, integrity checking of taxonomic and partonomic hierarchies is performed by LOOM's termin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Methods of information in medicine

دوره 44 4  شماره 

صفحات  -

تاریخ انتشار 2005